Generating Word and Document Embeddings for Sentiment Analysis

نویسندگان

چکیده

AbstractSentiments of words can differ from one corpus to another. Inducing general sentiment lexicons for languages and using them cannot, in general, produce meaningful results different domains. In this paper, we combine contextual supervised information with the semantic representations occurring dictionary. Contexts help us capture domain-specific scores are indicative polarities those words. When features extracted their dictionary definitions, observe an increase success rates. We try out combinations contextual, supervised, dictionary-based approaches, generate original vectors. also word2vec approach hand-crafted features. induce sentimental vectors two corpora, which movie domain Twitter datasets Turkish. thereafter document employ support vector machines method utilising vectors, our approaches perform better than baseline studies Turkish a significant margin. evaluated models on English corpora as well these outperformed approach. It shows that cross-domain portable other languages.KeywordsSentiment analysisOpinion miningWord embeddingsMachine learning

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Embeddings for Arabic Sentiment Analysis

Research and industry are more and more focusing in finding automatically the polarity of an opinion regarding a specific subject or entity. Paragraph vector has been recently proposed to learn embeddings which are leveraged for English sentiment analysis. This paper focuses on Arabic sentiment analysis and investigates the use of paragraph vector within a machine learning techniques to determi...

متن کامل

Refining Word Embeddings for Sentiment Analysis

Word embeddings that can capture semantic and syntactic information from contexts have been extensively used for various natural language processing tasks. However, existing methods for learning contextbased word embeddings typically fail to capture sufficient sentiment information. This may result in words with similar vector representations having an opposite sentiment polarity (e.g., good an...

متن کامل

Sentiment analysis leveraging emotions and word embeddings

Sentiment analysis and opinion mining are valuable for extraction of useful subjective information out of text documents. These tasks have become of great importance, especially for business and marketing professionals, since online posted products and services reviews impact markets and consumers shifts. This work is motivated by the fact that automating retrieval and detection of sentiments e...

متن کامل

A Convolution Kernel for Sentiment Analysis using Word-Embeddings

Accurate analysis of a sentence’s sentiment requires understanding of how words interact to convey emotion. Current works use the sentence’s parse tree and recursively compute sentiment of the constituent phrases. This approach is expensive and requires a human to annotate all subtrees of a sentence. We examine how using a lexical similarity kernel can leverage word-embeddings, generated in an ...

متن کامل

Learning Bilingual Sentiment Word Embeddings for Cross-language Sentiment Classification

The sentiment classification performance relies on high-quality sentiment resources. However, these resources are imbalanced in different languages. Cross-language sentiment classification (CLSC) can leverage the rich resources in one language (source language) for sentiment classification in a resource-scarce language (target language). Bilingual embeddings could eliminate the semantic gap bet...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-24340-0_23